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ABSTRACT 

Type II DNA topoisomerases are divided into two fam- 
ilies, MA and MB. Types MA and MB enzymes share 
homologous B subunits encompassing the ATP- 
binding site, but have non-homologous A subunits 
catalyzing DNA cleavage. Type MA topoisomerases 
are ubiquitous in Bacteria and Eukarya, whereas 
members of the MB family are mostly present in Ar- 
chaea and plants. Here, we report the detection of 
genes encoding type MB enzymes in which the A and 
B subunits are fused into a single polypeptide. These 
proteins are encoded in several bacterial genomes, 
two bacterial plasmids and one archaeal plasmid. 
They form a monophyletic group that is very diver- 
gent from archaeal and eukaryotic type MB enzymes 
(DNA topoisomerase VI). We propose to classify them 
into a new subfamily, denoted DNA topoisomerase 
VIM. Bacterial genes encoding a topoisomerase VIM 
are present within integrated mobile elements, most 
likely derived from conjugative plasmids. Purified 
topoisomerase VIII encoded by the plasmid pPPMIa 
from Paenibacillus polymyxa Ml had ATP-dependent 
relaxation and decatenation activities. In contrast, 
the enzyme encoded by mobile elements integrated 
into the genome of Ammonifex degensii exhibited 
DNA cleavage activity producing a full-length linear 
plasmid and that from Microscilla marina exhibited 
ATP-independent relaxation activity.Topoisomerases 
VIM, the smallest known type MB enzymes, could be 
new promising models for structural and mechanis- 
tic studies. 



INTRODUCTION 

DNA topoisomerases are essential for solving topological 
problems arising during DNA metabolic processes and are 
therefore critical for the preservation of genome stability 
(1-3). They can interconvert different topological forms of 
DNA and either catenate or decatenate DNA rings by gen- 
erating transient single- (type I topoisomerases) or double- 
stranded (type II topoisomerases) DNA breaks in DNA 
backbones. Type II enzymes are especially important be- 
cause of their unique capacity to catalyze the transfer of 
one DNA duplex through another. They are essential for 
resolving catenanes generated between daughter chromo- 
somes at the end of DNA replication and also play a ma- 
jor role in relaxing positive supercoils generated by track- 
ing processes occurring during transcription and replica- 
tion. Accordingly, all cellular organisms, without exception, 
have at least one type II enzyme. 

Type II topoisomerases are classified into two families, 
IIA and IIB, based on sequence and structural similarities 
(4-7). Archaeal and bacterial types IIA and IIB enzymes are 
heterotetramers composed of two different subunits (A and 
B), whereas type IIA enzymes from eukaryotes and their 
viruses are homodimers, with the B and A moieties fused 
into a single polypeptide. Several high-resolution structures 
of both types IIA and IIB enzymes have been solved, high- 
lighting some structural similarities, but also large differ- 
ences between these two families (8-11). The B subunits 
are homologous in the two families and contain a similar 
ATP-binding site located within a protein domain known as 
the Bergerat fold, which is characteristic of proteins of the 
GHKL (Gyrase, Hsp90, histidine Kinase, MutL) superfam- 
ily (4,12). Moreover, enzymes from the two families share 
two other functional domains: the Toprim domain display- 
ing the Rossmann fold (involved in magnesium binding) and 
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the Winged Helix Domain (WHD), which contains the ac- 
tive site tyrosine. However, the Toprim domain of type IIA 
topoisomerases is located within the B subunit, whereas in 
the TIB family, this domain is located within the A subunit. 
Furthermore, their WHDs are unrelated, despite the pres- 
ence of the catalytic tyrosine in both cases (Figure 1). Over- 
all, the A subunits of types IIA and IIB topoisomerases 
share neither sequence nor structural similarity. This sug- 
gests that these two enzyme families originated indepen- 
dently via the association of homologous B subunits with 
non homologous A subunits (5). 

The origin and evolution of topoisomerases are rather 
puzzling. Ribosomal proteins exist in three distinct ver- 
sions, corresponding to the three cellular domains, Archaea, 
Bacteria and Eukarya (13,14); however, each topoisomerase 
family exists in several versions (hereafter called subfam- 
ilies) that do not overlap with the three domains in phy- 
logenomic analyses (5-7,15). In addition, several groups of 
viruses encode topoisomerases that are not related to those 
of their hosts but instead form monophyletic groups that 
branch in between cellular domains in phylogenetic analy- 
ses (5,7,15). 

Several distinct subfamilies of type IIA enzymes are 
known, which differ in catalytic activity and/or quater- 
nary structure. These include (i) DNA gyrases, present 
in all bacteria, in some members of the archaeal phylum 
Euryarchaeota and in eukaryotes with endosymbionts of 
cyanobacterial origin; (ii) topoisomerase IV enzymes, spe- 
cific to bacteria; (iii) eukaryotic type II topoisomerases, 
present in all eukaryotic species; (iv) viral type II topoiso- 
merases encoded by some Nucleocytoplasmic Large DNA 
Viruses (NCLDV) that infect eukaryotes; and (v) viral type 
II topoisomerases encoded by T4-like myoviruses that infect 
bacteria. 

The distribution of type IIB topoisomerases is more lim- 
ited than that of type IIA. This family includes archaeal 
type IIB enzymes, also called topoisomerase VI, with close 
homologs in Archaeplastida (plants, red and green algae), 
some protists and a few bacteria (7,15). Topoisomerases VI 
are relaxing enzymes devoid of gyrase activity (16), are the 
only type II topoisomerases present in most archaea, sug- 
gesting that they are critical for chromosome segregation. 
Topoisomerases VI are also the only enzymes capable of re- 
laxing positive supercoils in two of the three major archaeal 
phyla (17). Thus, these enzymes are also essential for man- 
aging the waves of positive supercoiling generated during 
DNA replication and transcription. 

In Eukarya, type IIA topoisomerases control topolog- 
ical stress generated by chromosome segregation, replica- 
tion and transcription. However in plants, type IIB topoiso- 
merases (close relatives of archaeal topoisomerase VI) con- 
trol endoreduplication, a polyploidization process respon- 
sible for the enlargement of plant cells, which in turn deter- 
mines plant size (18). 

The inconsistent phylogenomic pattern of topoiso- 
merases in the three domains of life is common to most pro- 
teins involved in DNA replication, recombination or repair 
(for the case of DNA polymerases, see (19)). Lateral gene 
transfers cannot account for the distribution of these en- 
zymes within the classical tree of life based on the ribosome. 
Instead, this pattern requires more elaborate evolutionary 



scenarios. We proposed previously that proteins involved in 
DNA metabolic activities (including topoisomerases) origi- 
nated first in the viral world and were later transferred 'ran- 
domly' to various cellular lineages (20). This scenario pre- 
dicts that some contemporary viruses or related mobile ele- 
ments (plasmids, transposons) still encode unique versions 
of DNA replication proteins, such as topoisomerases, that 
were never transferred to the 'cellular world' (except as in- 
tegrated mobile elements) (7). 

Here, we report the in silico discovery and preliminary 
biochemical characterization of a new DNA topoisomerase 
subfamily that supports the prediction outlined above. The 
genes encoding these proteins correspond to the fusion of 
the B and A subunits of type IIB topoisomerases. We iden- 
tified these genes in 19 bacterial genomes and in three plas- 
mids (two from bacteria and one from an archaeon). In 
bacterial genomes, these genes are located within integrated 
mobile elements related to conjugative plasmids. We puri- 
fied three members of this new subfamily of type IIB en- 
zymes (that we propose to call topoisomerase VIII): one 
from a mesophilic bacterium, one from a thermophilic bac- 
terium and one from a bacterial plasmid. We show that 
the enzyme encoded by the bacterial plasmid exhibits ATP- 
dependent relaxation and decatenation activities, the hall- 
mark of type II topoisomerases, whereas the other two 
enzymes, encoded by integrated elements, exhibit DNA 
cleavage activity, producing DNA double-stranded breaks, 
and/or ATP-independent relaxation activity. 

MATERIALS AND METHODS 

Plasmids and reagents 

Negatively supercoiled plasmid pBR322 DNA was pur- 
chased from Fermentas life science and kDNA (kineto- 
plast DNA) from Topogen. The reverse-gyrase enzyme was 
kindly given by Prof Marc Nadal. Primers for site-directed 
mutagenesis were synthesized by Sigma- Aldrich. 

Sequence database search 

The fusion of the topoisomerase VI protein subunits A 
and B was used as a query to carry out psi-BLAST and 
tBLASTn searches of different databases on the NCBI 
website (http://blast.ncbi.nlm.nih.gov/Blast.cgi). A multi- 
ple sequence alignment was obtained with Pipe Align (21) 
and manually corrected based on conserved motifs. Sec- 
ondary structures predictions were performed with PSI- 
PRED (22). 

Phylogenetic analysis 

For phylogenetic analysis, 27 identified homologs of topoi- 
somerases VIII were aligned with MUSCLE 3.8.31 (23) 
and manually inspected with Seaview 4.3.3 (24). The ahgn- 
ment was trimmed with BMGE software (25) and the 
BLOSUM30 matrix, resulting in 571 amino acid positions. 
ProtTest 3 (26) was used to determine the best substitu- 
tion model for further analysis with the Akaike Informa- 
tion Criterion (AICc) criteria. Maximum likelihood anal- 
ysis was preformed with RaxML 7.4.2 (27) and the com- 
bined tree-search/fast bootstrap method ('-f a') under the 
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Figure 1. Domain organization of type II topoisomerases of the A and B families. Homologous domains are colored similarly, with color intensities differing 
according to the extent of sequence similarity. Bf Bergerat fold (4) corresponding to the ATP-binding site with three conserved amino-acid signatures (small 
vertical bars), H2TH: Helix-2Turn-Helix domain, WHD: Winged Helix Domain; Y: catalytic tyrosine; triangle: catalytic site. The distributions of various 
type II topoisomerase subfamilies in the three domains of life, Archaea (A, a). Bacteria (B, b) and Eukarya (E, e) are indicated in brackets. 



PROTGAMMALG model (four discrete rate categories) 
with 1000 fast bootstraps. Bayesian analysis was performed 
with MrBayes 3.0 (28) and the mixed amino acid model 
with four discrete rate categories, which supported the 
WAG model with 100% posterior probability. Four Markov 
chains starting with a random tree were run simultaneously 
for 1 million generations, sampling trees at every 100th gen- 
eration. The first 2500 sampled trees (25%) were discarded 
as 'burn in'. 

Protein structure predictions 

Structural predictions were performed with the Phyre2 web- 
server (29). The N- and C-terminal halves of the three topoi- 
somerase VIII sequences (corresponding to the B and A 
topoisomerase VI subunits, respectively) were also modeled 
separately because conserved regions and motifs are scat- 
tered all along their alignment. The model was validated 
by SWISS-MODEL (30). Comparisons between different 
topoisomerases models were conducted with the PyMOL 
Molecular Graphics System (http://www.pymol.org). 

Analysis of the genetic context and DotPlot analysis 

The genomic context of identified topoisomerase VIII genes 
was analyzed with CLC Genomic Workbench as described 
previously (31). The topoisomerase VIII gene-containing 
region was considered to be potentially mobile when two 
criteria were satisfied: (i) the integrase-coding gene was 
present in proximity of the topoisomerase VIII gene; and 
(ii) the region containing both topoisomerase VIII- and 



integrase-coding genes was flanked by direct repeats (at- 
tachment [att] sites). Gepard (32) was used to generate dot- 
plots for the analysis of large mobile genomic regions. 

Recombinant protein expression and purification 

The gene encoding the topoisomerase VIII from Am- 
monifex degensii (gene ID: 646359367), Microscilla marina 
ATCC 23134 (gene ID: 640218028) and Paenibacillus 
polymyxa Ml (gene ID: 2518132008) were synthesized 
by GenScript@. The synthetic genes of A. degensii and 
M. marina were cloned into a pUC57 vector, and am- 
plified by PGR with primers containing Strep-Tag, Ndel 
and NotI restriction sites. The PGR products with the 
Strep-tags either at the 5' or 3' ends of the genes were 
cloned into a pET26b expression vector and the se- 
quences of recombinant topoisomerase VIII clones were 
confirmed by DNA sequencing. One of each of the recom- 
binant plasmids: MicNStrep.pET26b::topoisomerase, 
MicCStrep.pET26b::topoisomerase, AdegN- 
Strep.pET26b::topoisomerase and AdegC- 

Strep.pET26b::topoisomerase was used to transform 
Escherichia coli BL21 (DE3) strains (Novagen). The 
synthetic gene of P. polymyxa topoisomerase VIII was 
purchased directly with an N -terminal Strep-Tag. Bacteria 
were transformed with the various constructs and were 
grown subsequently in 1^ 1 of Luria Broth (LB) medium 
containing kanamycin (50 |jLg/ml), at 16°C overnight, after 
a heat shock at 42°C at the beginning of growth phase 
(ODsoonm of 0.1). Induction was carried out with 0.5 mM 
isopropyl-D-l-thiogalactopyranoside (IPTG) when cell 
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cultures reached an ODeoonm of 0.5. Cells were harvested 
by centrifugation, stored overnight at — 80°C, and then 
suspended in a Tris-HCl 40 mM pH 8.0, NaCl 200-1000 
mM, Dirhiotreitol (DTT) 1 mM, acide ethylene diamine 
tetraacetique (EDTA) 0.1 mM buffer containing protease 
inhibitors. Cell lysis was completed by sonication. The cell 
extracts were then centrifuged at 10 000 x g for 15 min 
at 4°C to remove cellular debris and aggregated proteins. 
Strep-tagged proteins from the soluble fraction were 
purified by gravity-flow chromatography on a Strep-Tactin 
column (IBA BioTAGnology) according to the manufac- 
turers recommendation and the eluted proteins were then 
run on a gel filtration column (Superdex™ 200 16/600, 
GE Healthcare) with an FPLC AKTA system. The column 
was equilibrated with buffer B (buffer A containing 10% 
(v/v) ethylene glycol or glycerol) and the protein was eluted 
in the same buffer. Topoisomerase VIII enzymes were 
detected by checking for the presence of a polypeptide with 
the expected size on an Sodium dodecyl sulfate (SDS)- 
polyacrylamide gel. Fractions containing topoisomerase 
VIII polypeptides were pooled and concentrated with 
Amicon 30 kDa cutoff concentrators (Millipore). Protein 
concentration was determined and the fractions were 
aliquoted and stored at — 80°C. 

In vitro mutagenesis 

Plasmids bearing P. polymyxa and M. marina topoi- 
somerase VIII mutant genes were generated with the 
QuikChange site-directed mutagenesis kit (Stratagene). Af- 
ter mutagenesis, plasmids were purified with Macherey 
Nagel minipreps kit and sequenced to ensure the absence 
of unwanted mutations. 

Positively supercoiled pBR322 preparation 

Positively supercoiled pBR322 was prepared by incubation 
of pBR322 plasmid for 30 min at 90°C in the presence of 
Sulfolobus solfataricus reverse-gyrase (RGl) in buffer con- 
taining 50 mM Tris-HCl [pH 8.0], 20 mM MgCh, 1 mM 
adenosine triphosphate (ATP) and 1 mM DTT (33). The re- 
action was stopped with 10 mM EDTA and 200 mM NaCl 
and the positively supercoiled plasmid was purified with the 
Macherey-Magel NucleoSpin Gel and PCR Clean-up kit. 

DNA relaxation and decatenation assays 

Relaxation assays (per 20 ixl) were performed with nega- 
tively or positively supercoiled pBR322 plasmid or kDNA 
(200 ng) and the indicated amount of enzyme in buffer 
containing 50 mM Tris [pH 8.0], 2.5 mM MgCh, 0.1 mM 
EDTA and ATP (as indicated in the figure legends) for 40 
min at 20-30°C or 10 min at 70°C depending on the topoi- 
somerase host. The reaction also contained a final concen- 
tration of 0.30 M NaCl and ^^19% glycerol from the pro- 
tein storage buffer. In some cases at the end of the reaction, 
an additional incubation of 30 min at 55°C, was done after 
addition of 2 |xl sodium dodecyl sulfate (SDS) 10% and 2 
|xl proteinase K (1 mg/ml) to check for stabilized cleavable 
complexes. 

Reactions were stopped with 2 |jl1 1(10% SDS):1(30% 
glycerol) stop buffer. The reactions were loaded on a 1% 



agarose gel in 0.5 x Tris-Borate-EDTA (TBE) buffer (45 
mM Tris-borate [pH 8.3], 1 mM EDTA). The gels were run 
at 50 V/cm for 4 h, stained with ethidium bromide (EtBr) 
and visualized with an imaging system. 

RESULTS 

Identification of DNA topoisomerase VIII, a new subfamily 
of type IIB topoisomerase 

We routinely screen the NCBI non-redundant protein and 
environmental sequence databases by psi-BLAST (34) to 
search for new type IIB topoisomerase-encoding genes. 
During the course of this work, we used the B subunit of 
topoisomerase VI from particular archaea (e.g. Sulfolobus 
shibatae) as a query. This led to the detection of a diver- 
gent version of type IIB topoisomerases in which the Berg- 
erat fold (ATP-binding site) and the Toprim domain were 
present in the same polypeptide. We used these unusual pro- 
teins as queries to carry out additional Psi-Blast searches. 
This retrieved a set of closely related proteins of similar sizes 
corresponding to the topoisomerase VI subunits B and A 
fused, in that order, into a single polypeptide (Figure 1). Af- 
ter two iterations, we found that genes encoding these atypi- 
cal type IIB topoisomerases are present in 1 1 complete and 
8 partial bacterial genomes (Supplementary Table SI). In 
addition, we found two genes encoding these proteins on 
plasmids (Supplementary Table SI): one in the halophilic 
archaeon Halalkalicoccus jeotgali B3 (plasmid 5) and the 
other in the firmicute P. polymyxa Ml (pPPMla). We also 
identified a very similar protein encoded by two genes (sep- 
arated by 41 bp) located on a plasmid in Paenibacillus alvei 
DSM 29 (pPAV109): one gene encodes a protein homolo- 
gous to the N -terminal moiety of the topoisomerase VI B 
subunit, whereas the other encodes a protein homologous 
to the C-terminal moiety of the B subunit fused to a do- 
main homologous to the topoisomerase VI subunit A (Fig- 
ure 1). Moreover, we also detected small DNA fragments 
encoding these proteins in several whole genome shotgun 
and environmental sequence databases (not shown). 

These atypical type IIB topoisomerases, with sizes be- 
tween 695 and 882 amino acids, are annotated either as 
topoisomerase VI, hypothetical proteins, ATP-binding pro- 
teins, Toprim or topoisomerase (ATP hydrolyzing). They 
harbor the four conserved regions important for type II 
topoisomerase function, i.e. the Bergerat fold (ATP-binding 
site), the transducer domain, the WHD and the Toprim do- 
mains, as well as the small H2TH domain specific of the 
type IIB family (Figures 1 and 2). The putative active site 
tyrosine was adjacent to either another tyrosine (YY) or 
a phenylalanine (FY), with the exception of the enzymes 
from Roseobacter sp. AzwK-3b and Rhizobium gallicum in 
which the tyrosine is preceded by a methionine (MY). This 
can be considered as a hallmark of the type IIB topoiso- 
merase family, because topoisomerase VI and Spoil pro- 
teins always have YY or FY in their active sites. In con- 
trast, this feature is usually not observed in the type IIA 
family. Moreover, the identified proteins displayed only a 
remote relationship with topoisomerase VI, and their simi- 
larity was restricted to specific amino-acid signatures within 
the conserved motifs (Figure 2 and Supplementary Figure 
SI). Overall, the identified group of enzymes is not more 
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Figure 2. Conserved amino-acid regions shared between topoisomerases VI and VIII enzymes. In red, amino acids sliared between the two families; in bhie, 
amino acids specific for the topoisomerase VIII sub-family; in green, amino acids specific for the topoisomerase VI sub-family. Underlined are alternative 
amino acids rarely found in otherwise strictly conserved motifs. Adeg.- Ammonifex degensii topoisomerase VIII, Teth: Thermoanaerohacter ethanolicus 
topoisomerase VIII, Mmar.- Microscilla marina topoisomerase VIII; pPol: Paenihacillm polymyxa Ml plasmid pPPMla topoisomerase VIII; Sshi: Sul- 
folohus shihatae topoisomerase VI; Mmaz: Methanosarcina mazei topoisomerase VI; Anae: Anaeromyxohacter sp. topoisomerase VI; Atha: Arabidopsis 
thaliana topoisomerase VI. Adeg, Teth, Mmar, Anae and P. polymyxa are Bacteria, Sshi and Mmaz are Archaea, Atha is a eukaryote. 



closely related to bona fide bacterial topoisomerase VI than 
it is to archaeal or eukaryal topoisomerase VI. In contrast, 
these atypical type IIB enzymes exhibit extensive sequence 
similarity with each other, throughout their entire sequence. 
Furthermore, these proteins share a specific amino-acid sig- 
nature, 'R V/I E L N A/S M', in their C-terminal region 
(Supplementary Figure SI), which is not found in topoiso- 
merases from the currently established families and subfam- 
ilies. 

All these observations strongly suggest the ancient diver- 
gence of these enzymes from topoisomerase VI, as opposed 
to recent independent fusions of topoisomerase VI subunits 
A and B in various bacterial lineages. Based on these obser- 
vations as well as the data presented below, we propose that 
these proteins should be considered as a distinct subfam- 
ily of type IIB enzymes named DNA topoisomerase VIII. 
This classification is consistent with the historical nomen- 



clature of topoisomerases in which subfamilies of type I en- 
zymes have been systematically given odd numbers (topoi- 
somerases I, III and V) and type II enzymes even numbers 
(topoisomerases II, IV, VI and VIII) (3). 

Topoisomerase VIII is present in three bacterial phyla: 
Firmicutes (Clostridia and Bacilli), Bacteroidetes (Sphin- 
gobacteria) and Proteobacteria (alpha and gamma). No- 
tably, those present in evolutionarily close genera are 
grouped together in phylogenetic analysis, whereas topoi- 
somerase VIII enzymes from various genera of Proteobac- 
teria and Firmicutes sometimes display mixed phylogenetic 
patterns (Figure 3). The three topoisomerase VIII enzymes 
encoded by plasmids are grouped with cellular topoiso- 
merase VIII enzymes from species that are closely related 
to the hosts of these plasmids. For example, topoisomerase 
VIII enzymes encoded by pPPMla and pPAV109 from P. 
polymyxa and P. alvei, respectively, are grouped with the 
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Figure 3. Phytogeny of topoisomerase VIII. Bayesian pliylogeny of tlie 24 topoisomerase VIII homologs listed in Supplementary Table SI and two topoi- 
somerase VIII enzymes detected in metagenomes (571 amino acid positions). The tree was calculated with MrBayes (MIX model + gamma4). The scale 
bar shows the average number of substitutions per site. Values at nodes are posterior probabilities and bootstrap values calculated with the rapid bootstrap 
feature of RAxML (LG + gamma4) from heuristic searches of 1000 resampled datasets, when the same node was recovered. Topoisomerase VIII enzymes 
encoded by a plasmid are marked by circles. 



chromosomally-encoded topoisomerase VIII from Paeni- 
bacillus panacisoli. Similarly, the topoisomerase VIII en- 
coded by the plasmid from a haloarchaeon clusters with a 
sequence present in a metagenome from a hypersaline lake 
(Figure 3). These observations are consistent with previ- 
ous studies that have shown that mobile elements often co- 
evolve with their hosts (35,36). 

Most bacteria carry two type IIA topoisomerases, DNA 
gyrase and DNA topoisomerase IV, which have specialized 
functions (3). However, there are cases (e.g. Thermotoga 
maritima and Mycobacterium tuberculosis) where one dual- 
function enzyme is sufficient (3). Therefore, we considered 
the possibility that topoisomerase VIII may compensate for 
the absence of topoisomerase IV in some species. However, 
we found no correlation between the presence of topoiso- 
merase VIII and the presence/absence of topoisomerase IV 
in any of the species analyzed (Figure 3). 

Topoisomerase VIII genes present in bacterial genomes are 
located within integrated conjugative plasmids 

The presence of topoisomerase Vlll-encoding genes on 
three different plasmids and the sporadic distribution of 
these genes in taxonomically distant bacteria suggested that 
the presumably 'cellular' (i.e. encoded on bacterial chro- 
mosomes) gene copies are, in fact, be carried by integrated 



mobile genetic elements (37). Indeed, our previous analysis 
showed that integrated mobile elements often carry genes 
encoding proteins involved in DNA metaboHsm (35,38). 
We performed a detailed analysis of the genetic context 
of the 'cellular' topoisomerase Vlll-encoding genes to ver- 
ify this hypothesis. Unfortunately, 12 of the 19 topoiso- 
merase Vlll-encoding bacterial genomes were only avail- 
able as WGS (whole genome shotgun) libraries consisting 
of genomic contigs of variable lengths, complicating their 
analysis. Nonetheless, we were able to obtain evidence sug- 
gesting that at least 10 of the seemingly cellular topoiso- 
merase Vlll-encoding genes are encoded by mobile ele- 
ments (Supplementary Table SI). 

In several cases, the mobile elements could be detected via 
the identification of their integration sites. During recombi- 
nation mediated by the element-encoded integrase, the tar- 
get sequence (attachment site, attB) is duplicated, and flanks 
the element from both ends as direct repeats (attL and attR). 
One of the repeats is typically found next to a gene encoding 
the recombinase. We identified the precise targets of inte- 
gration for seven of the elements. Two elements, AmmDeg- 
El (in A. degensii KC4) and RosAzw-El (in Roseobacter 
sp. AzwK-3b), are integrated in intergenic regions, whereas 
PaePan-El (in R panacisoli \:>SM2UA5) and SinFre-El (in 
Sinorhizoblum fredii CCBAU 45436) recombined with 3'- 
distal regions of protein-coding genes (Supplementary Ta- 
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ble SI). In addition, elements AgrTum-El (in Agrohac- 
terium tumefaciens WRT31), DesKuz-El (in Desulfotomac- 
ulum kuznetsovii DSM 6115) and RhiGal-El (inR. gallicum 
bv. gallicum R602sp) are integrated into tRNA-Thr, tRNA- 
Pro and tRNA-Cys genes, respectively (Figure 4). 

A search for putative attachment sites within topoiso- 
merase Vlll-encoding genomes did not result in the iden- 
tification of additional integrated elements. This may be be- 
cause some genomic contigs containing the integrated ele- 
ment were incomplete, the elements used another mecha- 
nism for integration, or the elements were in an advanced 
stage of decay, which would render the direct repeats un- 
recognizable due to mutations. Thus, we used an alterna- 
tive approach involving the comparison of topoisomerase 
Vlll-encoding and topoisomerase Vlll-free closely related 
bacterial genomes. The integrated elements were expected 
to disrupt the collinearity of the corresponding genomic 
loci. DotPlot analysis showed that the genomic regions 
encoding topoisomerase VIII in Desulfosporosinus meri- 
diei DSM 13257, Desulfitobacterium hafniense DCB-2 and 
D. hafniense Y51 are probably mobile. Notably, the ele- 
ments in two D. hafniense strains are located in distinct 
genomic loci (Supplementary Figure S2). In addition, the 
topoisomerase Vlll-encoding contig of D. hafniense DP7 is 
collinear throughout its length with the corresponding mo- 
bile region in D. hafniense DCB-2 and D. hafniense Y51. 
This suggests that this contig contains a mobile element 
which is related to the two other D. hafniense elements, De- 
sHaf_DCB-2 and DesHaLYSl. 

The predicted integrated elements did not contain any vi- 
ral signature genes (e.g. for virion components or genome 
packaging), suggesting that they are unlikely to be of vi- 
ral origin. In contrast, many of the genes have homologs 
in bacterial plasmids (Figure 4). For example, 27 of the 42 
open reading frames present in AmmDeg-El had plasmid 
homologs, indicating that the topoisomerase Vlll-encoding 
elements are derived from integrative plasmids. We focused 
on elements for which exact boarders could be unequivo- 
cally defined to analyze genetic content in more detail (Fig- 
ure 4). Elements PaePan-El, AmmDeg-El and DesKuz-El 
and plasmids pPPMla and pPAV109 encode both proteins 
typical of conjugative plasmids and integrative and con- 
jugative elements (ICE) (31); e.g. all five elements encode 
TraG-like conjugal transfer coupling protein homologs. In 
addition, AmmDeg-El and DesKuz-El share homologs of 
ParM, a protein responsible for plasmid segregation upon 
cell division. Both elements also have several copies of 
toxin-antitoxin modules (RelE/B-like in AmmDeg-El and 
DesKuz-El, HicA/HicB in AgrTum-El and MazE/F-like 
in AmmDeg-El; Figure 4), which often ensure the stable 
maintenance of plasmids and ICEs (39). Several other pro- 
teins involved in DNA metabolism besides topoisomerase 
VIII are encoded within the predicted elements. AmmDeg- 
El and DesKuz-El encode homologs of the replication pro- 
tein RepE of plasmid F, supporting further a link between 
the two integrated elements and conjugative plasmids. In- 
terestingly, AmmDeg-El carries next to the RepE-like gene 
a homolog of the bifunctional primase -polymerase, which 
is typically found in various bacterial and archaeal mo- 
bile elements (40^2), whereas DesKuz-El contains at the 
same position a gene corresponding to a unique fusion of 



DnaG-like primase and DnaB-like helicase (Figure 4). The 
two smaller elements, RosAzw-El and RhiGal-El, encode 
a homolog of family I DNA polymerase and a DNA ligase, 
respectively. Neither RosAzw-El nor RhiGal-El contains 
identifiable genes for components of the conjugal appara- 
tus. 

The elements harboring topoisomerase Vlll-encoding 
genes appear to be at different stages of deterioration. At 
one end of the spectrum are the three plasmids, which prob- 
ably rely on the encoded DNA metabolism proteins, includ- 
ing topoisomerase VIII, for their efficient replication. How- 
ever, some of the integrated elements appear to be in the 
process of being inactivated. Analysis of the AmmDeg-El 
revealed that a number of genes, including those encoding 
superfamily II helicase, DNA methyltransferase, endonu- 
clease, and two transposases, have accumulated mutations 
resulting in premature stop codons (Figure 4). However, we 
could identify the attachment sites (perfect direct repeats) 
flanking this element, suggesting that the insertion of this 
element into the bacterial chromosome was probably a re- 
cent event. At the opposite side of the spectrum are the el- 
ements for which the borders could not be defined, as in 
the case of M. marina ATCC 23134 (Supplementary Table 
SI). Notably, transposase may be involved in the process of 
element inactivation because topoisomerase Vlll-encoding 
genes in Dehalobacter species CP, DCA and UNSWDHB 
are located next to truncated transposase genes. 

Structural analysis of topoisomerases VIII 

We performed structural modeling of the three topoi- 
somerase VIII enzymes chosen for biochemical analyses 
(see below), including those from A. degensii KC4 (Adeg, 
882 aa), M. marina ATCC 23134 (Mmar, 783 aa) and R 
polymyxa Ml plasmid pPPMla (pPpol, 751 aa). Structures 
of topoisomerase VI from S. shibatae (pdb code: 2ZBK; 
(10)) and Methanosarcina mazei{pdh code: 2Q2E; (9)) were 
used as templates. Despite their low sequence similarities, 
the three full-length topoisomerase VIII enzymes have the 
same modular organization as topoisomerase VI and con- 
tain five domains. Topoisomerase VIII contains the N- 
terminal Bergerat fold, followed by the S13-like H2TH and 
the ribosomal S5 domains 2-like f'old that is present in the 
transducer domain of topoisomerase VI. This combination 
of domains is specific to the subunit B of type IIB topoi- 
somerase. Topoisomerase VIII enzymes also possess a C- 
terminal WHD and Toprim domains; the organization of 
the Toprim domain is specific to type IIB topoisomerase 
subunit A (in type IIA enzymes, Toprim follows the trans- 
ducer domain). 

The ATP-lid in the ATP -binding site of topoisomerase 
VIII is highly conserved (Supplementary Figure S3). The 
switch lysine, which interacts with the 7 -phosphate of the 
bound nucleotide is at position 446 in A. degensii, 374 in M. 
marinus and 369 in P. polymyxa topoisomerase VIII (Sup- 
plementary Figure S3). This lysine is conserved in all type 
IIA enzymes and topoisomerase VI transducer domains 
(K427 in S. shibatae topoisomerase VI). In the modeled 
structures, this switch lysine points away from the active site 
because the template used corresponds to the relaxed state 
conformation (9). 
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In regions corresponding to topoisomerase VI subunit B, 
the major difference between the two type IIB subfamilies is 
the length of the transducer domain, which is clearly smaller 
in topoisomerase VIII. The end of the last helix stops at 
nearly the middle of all-helix of the S. shibatae topoiso- 
merase VI structure (see alignment in Supplementary Fig- 
ure SI and Table S2). This corresponds to residue K450 
which was identified as a hinge residue involved in helix 
bending that is crucial for conformational changes during 
the catalytic cycle (opening-closing of the N-gate) (10). The 
WHD domain of DNA topoisomerase VIII, a three -helix 
bundle, is smaller than the equivalent domain of topoiso- 
merase VI (80 residues instead of 160: Supplementary Table 
S2). The first helix of this domain acts also as a junction be- 
tween the transducer and the WHD domains (which corre- 
sponds to the junction between the B and A subunits). The 
structure of the Toprim domain is highly conserved and all 
the residues imphcated in magnesium binding (E209, D261 
and D263 in the S. shibatae structure) are spatially con- 
served. 

Based on the structural similarities of the individual do- 
mains, we suggest that the architecture of the topoisomerase 
VIII homodimer is comparable to that of the topoisomerase 



VI heterotetramer. However, the C-terminal helix of the 
transducer and the N-terminal helix of the WHD domain 
are lacking in topoisomerase VIII; therefore, it is difficult 
to predict how the short junction characteristic of these en- 
zymes organizes the orientation between these two domains 
(compare Figure 5 and Supplementary Figure S4). In addi- 
tion, the |3-turn-p in the topoisomerase VI Toprim domain 
responsible for the interactions between the two A subunits 
and the formation of a continuous p-sheet is not present in 
topoisomerase VIII. Therefore, it is difficult to propose a 
model for the full-length homodimer. 

Topoisomerase VIII enzymatic activities 

Impairment of the excision of integrated elements occasion- 
ally leads to permanent fixation of new functions within 
cellular lineages, which may even replace ancestral cellu- 
lar counterparts. This route has been proposed for the ori- 
gin of a novel nuclear protein in Dinoflagellates (43) and 
bacteriophage T3/T7-like RNA and DNA polymerases in 
mitochondria (44). With this in mind, we set out to verify 
experimentally whether plasmid-borne topoisomerase VIII 
possesses its predicted enzymatic activity and whether ho- 
mologs encoded by (decaying) integrated elements retain 
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Figure 5. Ribbon representation of the crystal structure of Sulfolohus shihatae topoisomerases VI B (top) and A (bottom) subunits (Sshi; pdb code: 2ZBK) 
and molecular models of the N- (top) and C-terminal (bottom) domains of topoisomerase VIII from Aminonifex degensii KC4 (Adeg), Microscilla marina 
ATCC 23134 (Mmar) and Paenibacillus polymyxa Ml plasmid pPPMla (pPpol). The Bergerat fold domain is shown in yellow, the H2TH in orange, the 
transducer in brown, the WHD in blue, the Toprim domain in red and the topoisomerases VIII specific C-terminal domain of the A subimit in pale green. 



these activities. For this purpose, we selected and purilied 
three proteins encoded by elements in different stages of de- 
cay (see above): (i) we selected topoisomerase VIII from P. 
polymyxa Ml plasmid pPPMla as a functional mobile el- 
ement (45), (ii) the protein from AmmDeg-El of the ther- 
mophile A. degensii KC4 (46) as a case of recent integration, 
and (iii) topoisomerase VIII from the mesophile M. marina 
ATCC 23 1 34 (47) as a representative of ancient integration. 

We produced the proteins in E. coli strains carrying the 
synthetic topoisomerase Vlll-encoding genes cloned into 
pET26-b vectors (see 'Materials and Methods' section). 
Most proteins were not soluble in classical expression con- 
ditions. However, soluble proteins were obtained in all cases 
by growing cells in early exponential phase at 42°C un- 
til OD 0.4-0.5 to induce endogenous chaperones, followed 
by overnight IPTG induction at 16°C. The topoisomerase 
VIII enzymes were purified by affinity chromatography and 
gel filtration (see 'Materials and Methods' section). The 
purity of the topoisomerase VIII preparations was similar 
to that of the purified S. shibatae DNA topoisomerase VI 
(Supplementary Figure S5). The three enzymes eluted as 
dimers during gel filtration (Supplementary Figure S6 for 



P. polymyxa topoisomerase VIII), indicating that these en- 
zymes are homodimers, consistent with structural models. 

We used a negatively supercoiled pBR322 plasmid DNA 
as a substrate and reaction mixtures containing magnesium 
with or without ATP to assay DNA relaxation activity (see 
'Materials and Methods' section). We detected weak DNA 
relaxation activities with the enzymes of M. marina and 
P. polymyxa (Figure 6A and B). In contrast, the topoiso- 
merase VIII of A. degensii only exhibited DNA cleavage ac- 
tivity in the presence of SDS and proteinase K, producing 
full-length linear plasmids, probably corresponding to the 
formation of a cleavable complex (Supplementary Figure 
S7, lanes 2-^). All these activities were magnesium depen- 
dent. Surprisingly, the relaxation activity of the M. marina 
enzyme was ATP independent (Supplementary Figure S7, 
compare lanes 2 and 3). However, the addition of 1 mM 
ATP inhibited slightly both the relaxation activity of the M. 
marina enzyme and the DNA cleavage activity of the A. de- 
gensii enzyme (Supplementary Figure S7, lanes 4). The re- 
laxation activity of the M. marina protein was probably not 
due to a contamination, because it was abolished when the 
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Figure 6. Relaxation and decatenation assays. (A) Relaxation assays of wild-type and double mutants of Micro.sciUa marina (Mmar) topoisomerases VIII. 
Reactions were carried out without ATP and 200 ng of pBR322 was used as the substrate. Lane 1: control with no enzyme; lanes 2-7, wild-type (WT) 
with 1, 1.5, 2, 3, 6 and 12 pmol of enzyme, respectively; lanes 8-13, YY-FF double mutants with 0.5, 0.75, 1, 1.75, 3 and 6 pmol of enzyme, respectively. 
(B) Paenihacillus polymyxa (pPol) topoisomerase VIII relaxation assays with the wild-type enzyme; lanes 1-3, without ATP and with 10, 5, 2.5 pmol of 
enzyme, respectively, and lanes 4—6, with ImM ATP and 10, 5, 2.5 pmol of enzymes, respectively; and with the D61A mutant lanes 7-9, without ATP, with 
10, 5, and 2.5 pmol of enzyme, respectively, and lanes 10-12 with 1 mM ATP and 10, 5 and 2.5 pmol of enzyme, respectively. Supercoiled and relaxed (or 
open circular) topoisomers are noted as SC and R/OC, respectively. (C) PaenibaciUiis polymyxa topoisomerase VIII decatenation assays, with or without 
ATP; lane 1: no enzyme, lanes 2-A with 0, 1 and 2 mM ATP, respectively and 20 pmol of enzyme. kDNA: kinetoplastid DNA. 



two tyrosines of the active site were replaced with pheny- 
lalanines in a recombinant protein (Figure 6A). 

Notably, the relaxation activity of the P. polymyxa topoi- 
somerase VIII was lower than that of M. marina but ATP 
dependent, as expected of a type II DNA topoisomerase 
(Figure 6B and Supplementary Figures S8a, lanes 3, S8b, 
lanes 4 and 7). Substitution of the conserved aspartate in 
the P. polymyxa topoisomerase VIII Bergerat fold (D61), 
which is essential for ATP binding (D73 in E. coli DNA gy- 
rase) abolished the ATP-dependent relaxation activity (Fig- 
ure 6B). Treatment of the reaction products with proteinase 
K and SDS only produced linear DNA in the presence of 
ATP, suggesting the formation of a cleavable complex (Sup- 
plementary Figure S8, lane 6). Notably, we showed previ- 
ously that the formation of a cleavable complex is ATP- 
dependent for S. sliibatae DNA topoisomerase VI, whereas 
it is ATP independent for type IIA enzymes (48). 

Some type II topoisomerases relax positively supercoiled 
DNA more efficiently than negatively supercoiled DNA 
(49). We thus tested the activity of the P. polymyxa DN A 
topoisomerase VIII on a positively supercoiled plasmid. 
This enzyme also exhibited low relaxation activity on pos- 
itively supercoiled DNA (Supplementary Figure S8b). Fi- 
nally, decatenation assays with kDNA as a substrate for 



P. polymyxa DNA topoisomerase VIII showed that this 
enzyme had a very weak decatenation activity, which was 
slightly stimulated by ATP (Figure 8C). The poor stability 
of the proteins precluded further enzymatic characteriza- 
tion. 



DISCUSSION 

We have identified a new group of type IIB topoisomerases 
that we call DNA topoisomerase VIII, which are encoded 
by either free or integrated plasmids. We partially character- 
ized three proteins of this new subfamily. One of them ex- 
hibited only endonuclease activity, and the other two exhib- 
ited relaxation activity. In all cases, the relaxation activities 
detected with purified preparations of recombinant proteins 
were weak (complete relaxation was never observed). Fur- 
thermore, only the topoisomerase VIII encoded by the plas- 
mid of P. polymyxa exhibited the expected ATP-dependent 
relaxation activity of type II topoisomerases. It is possi- 
ble that the topoisomerase VIII of A. degensii and those 
of M. marina do not exhibit this activity because they are 
located on decaying elements and are no longer fully func- 
tional. The enzyme from P. polymyxa may exhibit the ex- 
pected ATP-dependent relaxation activity because its activ- 
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ity is still required for the maintenance of this large plasmid 
(366 576 bp). The complete absence of relaxation activity in 
the case of the A. degensii enzyme is somewhat surprising 
because our in silico analysis suggests that the insertion of 
this gene into the bacterial chromosome is relatively recent. 
This indicates that the activity of enzymes encoded by mo- 
bile elements can be rapidly lost after integration; therefore, 
caution should be exercised when studying the biochemi- 
cal properties of 'cellular enzymes' obtained by expressing 
genes present on cellular chromosome if the origin of these 
enzymes has not been carefully investigated. 

We found that the structures of the Bergerat fold do- 
mains of the three topoisomerase VIII enzymes were rather 
variable (Figure 5). The rmsd values varied from 1.5 A 
(A. degensii DNA topoisomerase Wlll/S. shibatae topoi- 
somerase VI) to 2.9 A {A. degensii topoisomerase VIII/ 
M. marina topoisomerase VIII and M. marina topoiso- 
merase VIII/5. shibatae topoisomerase VI). This variabil- 
ity may explain differences in the activities of the three pro- 
teins. However, the core fold is highly conserved (especially 
the ATP-binding pocket, see Supplementary Figure S3). 
The main difference involves an insertion in the A. degen- 
sii topoisomerase VIII and S. shibatae topoisomerase VI 
in the C-terminal end of the Bergerat fold domain that is 
not present in M. marina and P. polymyxa topoisomerase 
VIII. At present, we cannot make substantiated conclusions 
about the relationship between ATP-dependency and struc- 
tural differences among topoisomerase VIII enzymes from 
the available information, especially given that two highly 
conserved enzymes, the DNA gyrases of M. tuberculosis 
and Mycobacterium leprae, have different activity spectra 
(50,51)] 

The ATP-independent relaxation activity of the M. ma- 
rina topoisomerase VIII is reminiscent of that of bacte- 
rial DNA gyrase (52,53). Bates et al. suggested that the 
ancestral role of ATP in type II enzyme reactions was to 
prevent DNA double-strand breaks (DSBs) by controlling 
the separation of protein-protein interfaces (54). They sug- 
gested that the intersubunit interface is weaker in DNA 
gyrase than in other type II enzymes, explaining its ATP- 
independent relaxation activity (54). It is interesting that the 
three different forms of topoisomerase VIII studied here ex- 
hibited low ATP-stimulated relaxation, ATP-independent 
relaxation or produced DSBs. This suggests that the differ- 
ences between these three enzymes results from subtle mod- 
ifications of the protein-protein dimer interface. 

The low activity of the P. polymyxa topoisomerase VIII 
in our in vitro assays may be due to the absence of po- 
tentially important accessory proteins required for optimal 
activity in vivo. For example, plant topoisomerase VI en- 
zymes require two additional proteins. Midget and RLHl, 
for their activity, at least in vivo (55). Moreover, the cell divi- 
sion protein MukB stimulates the activity of £. co// topoiso- 
merase IV in vitro (56). Other topoisomerases require post- 
translational modifications to achieve full catalytic strength 
(57). If additional proteins or processing enzymes are in- 
deed required for the full activity of topoisomerase VIII, 
these accessory proteins are likely to be encoded by the plas- 
mids bearing the topoisomerase VIII genes. 



Origin and evolution of DNA topoisomerase VIII 

Several families and subfamilies of DNA topoisomerases 
are widespread in the living world. Some are almost uni- 
versal (such as topoisomerase lA) or present in all mem- 
bers of one or two cellular domains (such as DNA gyrases 
in Bacteria or type IIA topoisomerases in Bacteria and Eu- 
karya) (7). In contrast, topoisomerase VIII enzymes appear 
to be rare, because they are encoded only in about 0.5% of 
currently available bacterial genomes and are not present in 
archaeal or eukaryotic genomes. Moreover, topoisomerase 
VIII is only present in a handful of representatives of three 
bacterial phyla. Furthermore, two of them, Proteobacteria 
and Firmicutes, correspond to the two phyla with by far 
the highest number of completely sequenced genomes. This 
very narrow distribution of topoisomerase VIII in the liv- 
ing world resembles the case of topoisomerase V, a unique 
member of type I topoisomerases (family C) that is only 
present in the archaeon Methanopyrus handler i (58,59). It 
also resembles atypical topoisomerases found in the vi- 
ral world, such as the heterotrimeric type IIA enzymes 
encoded by T4-like bacteriophages and the homodimeric 
type IIA enzymes encoded by some Megavirales (7). Strik- 
ingly, topoisomerase VIII enzymes are encoded by plasmids 
(two from bacteria and one from archaea), and in bacte- 
rial genomes, they are located within integrated mobile ele- 
ments. This is therefore another example of a topoisomerase 
with an unusual phylogenomic distribution and complex 
evolutionary trajectory. 

The fusion of two bacterial topoisomerase VI subunits in 
one bacterial phylum, followed by sporadic spread to other 
lineages by lateral gene transfer is one hypothesis that ex- 
plains the rare and scattered distribution of topoisomerase 
VIII in Bacteria. However, this hypothesis seems unlikely 
considering the high divergence of primary sequence be- 
tween topoisomerase VI and VIII. Notably, bona fide topoi- 
somerase VI enzymes present in bacteria cannot be dis- 
tinguished from their archaeal homologs and branch with 
archaeal DNA topoisomerase VI enzymes in phylogenetic 
analyses (38); in contrast, topoisomerase VI and VIII en- 
zymes are so divergent that their amino-acid sequences can- 
not be reliably aligned for phylogenetic analyses. It is diffi- 
cult to explain why the fusion protein of the two topoiso- 
merase Vl-like subunits (i.e. the ancestor of topoisomerase 
VIII) would have diverged so rapidly in one particular bac- 
terial lineage but remained conserved during its dispersion 
in various bacterial lineages. 

The divergence between topoisomerases VI and VIII en- 
zymes suggests that the gene duplication at the origin of 
these two protein subfamilies occurred before the emer- 
gence of Archaea and Bacteria (Figure 7). In agreement 
with this view, topoisomerase VI was probably present in 
the last archaeal common ancestor (LACA) because this 
enzyme is now present in all archaea, with the exception 
of Thermoplasmatales (3). In contrast, the rarity of topoi- 
somerase VIII in bacteria suggests that this protein was 
not present in the last bacterial common ancestor (LBCA), 
but was already encoded by plasmids at that time. Modern 
topoisomerase VIII thus probably originated before the di- 
versification of Bacteria, by a fusion between the A and B 
subunits, following the duplication that initiated the diver- 



Nucleic Acids Research, 2014, Vol. 42. No. 13 8589 




Figure 7. Proposed scenario for tlie evolution of tlie type IIB topoiso- 
merase family. LBCA (last bacterial common ancestor), LAC A (last ar- 
chaeal common ancestor). Dotted double arrows indicate co-evolution of 
cellular and plasmidic lineages, with subsequent integration or loss of plas- 
midic topoisomerase Vlll-encoding genes in cellular genomes. Black dot- 
ted arrow and question mark, possible transfer of a bacterial plasmid to the 
archaeal domain. For simplicity, both the presence of a few topoisomerase 
VI enzymes in Bacteria (transferred from Archaea) and the secondary split 
of the topoisomerase VIII gene in the plasmid PAV109 of Paenibacillus 
alvei are not indicated. 



gence between topoisomerase VI and VIII enzymes (Fig- 
ure 7). In this scenario, the sporadic distribution of topoi- 
somerase VIII in the bacterial domain is easily explained 
by the co-evolution of bacterial plasmids bearing topoiso- 
merase Vlll-encoding genes with their hosts and the spo- 
radic integration of these genes into bacterial genomes (37). 

It has been suggested that DNA topoisomerases first 
originated and diverged into various families and subfam- 
ilies in the viral world (7). Viruses and plasmids are evo- 
lutionarily related and can be considered as the same se- 
quence space within a greater viral world (60-62). Accord- 
ingly, it is possible that type IIB topoisomerases first origi- 
nated and subsequently diversified into two subfamilies (VI 
and VIII) in this plasmid/viral world. Later on, topoiso- 
merase VI were transferred to the cellular members of the 
branch of the tree of life leading to LACA, whereas plas- 
mids encoding topoisomerase VIII co-evolved with ances- 
tors of the bacterial domain (Figure 7). The presence of 
the topoisomerase Vlll-encoding gene in the plasmid of a 
halophilic archaeon can also be explained by the presence 
of these plasmids in ancient archaeal organisms (LACA and 
close relatives). However, in this case, it is also possible that 
these plasmids were transferred from Bacteria to Archaea 
at the onset of the haloarchaeal lineage, which has experi- 
enced the introduction of around 1000 bacterial genes (63). 
Irrespective of the evolutionary scenario, the discovery of a 
subfamily of topoisomerases specifically encoded by plas- 
mids confirms that mobile elements are potential reservoirs 
of novel proteins involved in DNA metabolism (7,36,42,64). 



Possible physiological role of topoisomerase VIII 

The role of topoisomerase VIII in plasmid physiology re- 
mains to be established. Several conjugative plasmids from 
Proteobacteria and Firmicutes encode enzymes related to 
bacterial topoisomerase III (a subfamily of type lA en- 
zymes) (65). These plasmidic topoisomerase III enzymes are 
used either as decatenases to help the faithful segregation 
of these large plasmids with low copy number, or as swivels 
during conjugation to facilitate the unwinding of the donor 
strand that is transferred to the recipient host (65). Alterna- 
tively, they may also be involved in the resolution of hemi- 
catenane structures during plasmid replication or recombi- 
nation (66). Notably, the plasmids pPPMla oi P. polymyxa 
and pAV109 of P. a/ve; encode a DN A topoisomerase III, in 
addition to a topoisomerase VIII. Topoisomerase III may 
be involved in the resolution of hemicatenane structures, 
whereas topoisomerase VIII enzymes could be the decate- 
nase and/or swivelase involved in the segregation of these 
large plasmids (>100 kb). 

Interestingly, the genome of P. polymyxa contains nine 
loci dedicated to the synthesis of non-ribosomal peptides, 
one of which is located on the plasmid pPPMla (67). 
Of note, some plasmids encode proteins containing pen- 
tapeptide repeats that interact with cellular topoisomerases, 
as demonstrated by fluoroquinolone resistance proteins 
(68). These peptides may serve as important antibiotics 
in the control of plant pathogens. We can speculate that 
some of the peptides encoded by the plasmid pPPMla are 
also topoisomerase inhibitors. In that case, the plasmid- 
encoded topoisomerase III and topoisomerase VIII may 
also be resistant to these antibiotics, and serve as substi- 
tutes for antibiotic-sensitive cellular topoisomerases. Thus 
P. polymyxa may be an interesting model to study the role 
of topoisomerases in plasmid maintenance and transfer. It 
would also be relevant to screen Paenibacillus species for 
novel anti-topoisomerase inhibitors and/or new proteins 
interfering with topoisomerases. 
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